🎯 Reinforcement Learning - Scourface · Scour

Reinforcement Learning from Human Feedback

arxiv.org·17h

🎯Predictive Coding

Hybrid neural–cognitive models reveal how memory shapes human reward learning

nature.com·22h

🎯Predictive Coding

Quantization-Aware Distillation

ternarysearch.blogspot.com·5h·

Discuss: Hacker News

🔄Meta-Learning

On Computation and Reinforcement Learning

arxiv.org·2d

🎯Predictive Coding

Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time Obstacle Prediction **Abstra...

freederia.com·2d

Learning Models with Uniform Performance via Distributionally RobustOptimization

dev.to·18h·

Discuss: DEV

🎯Predictive Coding

Deep reinforcement learning-based energy scheduling for green buildings with stationary and EV batteries of heterogeneous characteristics

sciencedirect.com·1d

🧠Neuromorphic Computing

Continual learning and the post monolith AI era

baseten.co·1d·

Discuss: Hacker News

🧠Neuromorphic Hardware

Part 5: Reward Engineering: How to Shape Behaviors in Financial/Robotic Tasks

dev.to·2d·

Discuss: DEV

🎯Predictive Coding

Why reinforcement learning breaks at scale, and how a new method fixes it

techxplore.com·3d

🧠Neuromorphic Hardware

Performance Tip of the Week #94: Decision making in a data-imperfect world

abseil.io·9h

🎯Predictive Coding

i10e-lab/HelloRL: A fully modular framework to make Reinforcement Learning quick and easy

github.com·1d·

Discuss: Hacker News

🔄Meta-Learning

Physics-Informed Neural Networks for Inverse PDE Problems

pub.towardsai.net·15h

🤖Machine Learning

Personalized Adaptive Feedback System for Early Detection and Intervention of Fine‑Motor Skill Development in Preschool Children Using Wearable IMU Sensors and Reinforcement Learning

freederia.com·2d

🔄Meta-Learning

Hypernetworks: Neural Networks for Hierarchical Data

blog.sturdystatistics.com·2d·

Discuss: Hacker News

🎯Predictive Coding

lonestation.itch.io·13h

Barn Owls Know When to Wait (iuSTDP part 2)

blog.typeobject.com·10h·

Discuss: Hacker News

🧠Neuromorphic Hardware

On Economics of A(S)I Agents

lesswrong.com·12h

🧠Neuromorphic Hardware

learning by reverse engineering

clymup.com·17h

🔄Meta-Learning

Exploiting large language model with reinforcement learning for generative job recommendations

eurekalert.org·2d

🎯Predictive Coding

Loading more...